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4 <110> APPLICANT: SANTI, Daniel 

5 PECK, Lawrence 

6 KEALEY, James 

7 DAYEM, Linda 

9 <120> TITLE OF INVENTION: Heterologous Production of Polyketides 

12 <130> FILE REFERENCE: 010025.01 

14 <140> CURRENT APPLICATION NUMBER: 09/699, 136A 

15 <141> CURRENT FILING DATE: 2000-10-27 

17 <150> PRIOR APPLICATION NUMBER: 60/161,703 

18 <151> PRIOR FILING DATE: 1999-10-27 

20 <160> NUMBER OF SEQ ID NOS : 6 [C 

22 <170> SOFTWARE: FastSEQ for Windows Version 4.0 C 

24 <21.0> SEQ ID NO: 1 

25 <211> LENGTH: 1917 

26 <212> TYPE: DNA 

27 <213> ORGANISM: Propionibacteriuiti f reudenreichii ssp . shermanii 

29 <400> SEQUENCE: 1 

30 atgagcagca cggatcaggg gaccaacccc gccgacactg acgacctcac tcccaccaca 



Ti 



60 



31 ctcagcctgg ccggggattt ccccaaggcc actgaggagc agtgggagcg cgaagttgag 120 

32 aaggtactca accgtggtcg tccaccggag aagcagttga ccttcgccga gtgtctgaag 180 

33 cgcctgacgg ttcacaccgt cgatggcatc gacatcgtgc cgatgtaccg tccgaaggac 240 

34 gccccgaaga agctgggtta ccccggcgtc gcacctttca cccgcggcac cacggtgcgc 300 

35 aacggcgaca tggatgcctg ggacgtgcgc gccctgcacg aggatcccga cgagaagttc 360 

36 acccgcaagg cgatcctcga aggcctggag cgtggcgtca cctccctgtt gctgcgcgtt 420 

37 gatcccgacg cgatcgcacc cgagcacctc gacgaggtcc tctccgacgt cctgctggaa 480 

38 atgaccaagg tggaggtctt cagccgctac gaccagggtg ccgccgccga ggccctggtg 540 

39 agcgtctacg agcgctccga. caagccggcg aaggacctgg ccctcaacct gggcctggat 



600 



40 



840 
900 



cccatcgcgt tcgcagccct gcagggcacc gagccggatc tgaccgtgct cggtgactgg 660 

41 gtgcgccgcc tggcgaagtt ctcgccggac tcgcgcgccg tcacgatcga cgcgaacatc 720 

42 taccacaacg ccggtgccgg cgacgtggca gagctcgcct gggcactggc caccggcgcg 780 

43 gagtacgtgc gcgccctggt cgagcagggc ttcaccgcca ccgaggcctt cgacacgatc 

44 aacttccgtg tcaccgccac ccacgaccag ttcctcacga tcgcccgtct tcgcgccctg 

45 cgcgaggcat gggcccgcat cggcgaggtc ttcggcgtgg acgaggacaa gcgcggcgcc 960 

46 cgccagaatg cgatcaccag ctggcgtgac gtcacgcgcg aagaccccta tgtcaacatc 1020 

47 cttcgcggtt cgattgccac cttctccgcc tccgttggtg gggccgagtc gatcacgacg 1080 

48 ctgcccttca cccaggccct cggcctgccg gaggacgact tcccgctgcg catcgcgcgc 1140 

49 aacacgggca tcgtgctcgc cgaagaggtg aacatcggcc gcgtcaacga cccggccggt 1200 

50 ggctcctact acgtcgagtc gctcacccgc agcctggccg acgccgcctg gaaggaattc 1260 

51 caggaggtcg agaagctcgg tggcatgtcg aaggccgtca tgaccgagca cgtcaccaag 1320 

52 gtgctcgacg cctgcaatgc cgagcgcgcc aagcgcctgg ccaaccgcaa gcagccgatc 1380 

53 accgcggtca gcgagttccc gatgatcggg gcccgcagca tcgagaccaa gccgttcccc 1440 

54 gccgctccgg cgcgcaaggg cctggcctgg catcgcgact ccgaggtgtt cgagcagctg 1500 

55 atggatcgct ccaccagcgt ctccgagcgc cccaaggtgt tcctggcctg cttgggcacc 1560 
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cgccggcatc 1620 
caagaagtcg 1680 
gggacttgag 1740 
cgccttcaag 1800 
gtttatgggc 1860 
gaagtga 1917 



f reudenreichii ssp. sheritianii 



56 cgtcgcgact tcggtggccg cgagggcttc tcgagcccgg tgtggcacat 

57 gacaccccgc aggtcgaagg cggcaccacc gccgagatcg tcgaggcatt 

58 ggcgcccagg tggccgacct ctgctcgtcc gccaaggtct acgcgcagca 

59 gtcgccaagg cactcaaggc cgccggcgca aaggccctgt acctgtcggg 

60 gagttcggtg atgacgccgc cgaggccgag aagctgatcg acggacgcct 

61 atggatgtcg tcgacaccct gtcctccacc cttgatatct tgggagtcgc 

63 <210> SEQ ID NO: 2 

64 <211> LENGTH: 2187 

65 <212> TYPE: DNA 

66 <213> ORGANISM: Propionibacterium 

68 <4 00> SEQUENCE: 2 

69 gtgagcactc tgccccgttt tgattcagtt gacctcggca atgccccggt tcctgctgat 

70 gccgcacgac gcttcgagga actggccgcc aaggccggca ccggagaggc gtgggagacg 

71 gccgagcaga ttccggttgg caccctgttc aacgaagacg tctacaagga catggactgg 

72 ctggacacct acgcaggtat cccgccgttc gtccacggcc cgtatgcaac catgtacgcg 

73 ttccgtccct ggacgattcg ccagtacgcc ggtttctcca cggccaagga gtcgaacgcc 

74 ttctaccgcc gcaaccttgc ggccggccag aagggcctgt cggttgcctt cgacctgccc 

75 acccaccgtg gctacgactc ggacaatccc cgcgtcgccg gtgacgtcgg catggccggt 

7 6 gtggccatcg actccatcta tgacatgcgc gagctgttcg ccggcattcc gctggaccag 

77 atgagcgtgt ccatgaccat gaacggcgcc gtgctgccga tcctggccct ctatgtggtg 

78 accgccgagg agcagggcgt caagcccgag cagctcgccg ggacgatcca gaacgacatc 

79 ctcaaggagt tcatggttcg taacacctac atctacccgc cgcagccgag tatgcgaatc 

80 atctctgaga tcttcgccta cacgagtgcc aatatgccga agtggaattc gatttccatt 

81 tccggctacc acatgcagga agccggcgcc acggccgaca tcgagatggc ctataccctg 

82 gccgacggtg ttgactacat ccgcgccggc gagtcggtgg gcctcaatgt cgaccagttc 

83 gcgccgcgtc tgtccttctt ctggggcatc ggcatgaact tcttcatgga ggttgccaag 

84 ctgcgtgccg cgcgcatgtt gtgggccaag ctggtgcatc agttcgggcc gaagaacccg 

85 aagtcgatga gcctgcgcac ccactcgcag acctccggtt ggtcgctgac cgcccaggac 

8 6 gtctacaaca acgtcgtgcg tacctgcatc gaggccatgg ccgccaccca gggccatacc 

87 cagtcgctgc acacgaactc gctcgacgag gccatcgccc tgccgaccga tttcagcgcc 

88 cgcatcgccc gtaacaccca gctgttcctg cagcaggaat cgggcacgac gcgcgtgatc 
8 9 gacccgtgga gcggctcggc atacgtcgag gagctcacct gggacctggc ccgcaaggca 

90 tggggtcaca tccaggaggt cgagaaggtc ggcggcatgg ccaaggccat cgaaaagggc 

91 atccccaaga tgcgcatcga ggaagccgcc gcccgcaccc aggcacgcat cgactccggc 

92 cgccagccgc tgatcggcgt gaacaagtac cgcctggagc acgagccgcc gctcgatgtg 

93 ctcaaggtgg acaactccac ggtgctcgcc gagcagaagg ccaagctggt caagctgcgc 

94 gccgagcgcg atcccgagaa ggtcaaggcc gccctcgaca agatcacctg ggccgccggc 

95 aaccccgacg acaaggatcc ggatcgcaac ctgctgaagc tgtgcatcga cgctggccgc 

96 gccatggcga cggtcggcga gatgagcgac gcgctcgaga aggtcttcgg acgctacacc 
gcccagattc gcaccatctc cggtgtgtac tcgaaggaag tgaagaacac gcctgaggtt 
gaggaagcac gcgagctcgt tgaggaattc gagcaggccg agggccgtcg tcctcgcatc 

99 ctgctggcca agatgggcca ggacggtcac gaccgtggcc agaaggtcat cgccaccgcc 

100 tatgccgacc tcggtttcga cgtcgacgtg ggcccgctgt tccagacccc ggaggagacc 

101 gcacgtcagg ccgtcgaggc cgatgtgcac gtggtgggcg tttcgtcgct cgccggcggg 

102 catctgacgc tggttccggc cctgcgcaag gagctggaca agctcggacg tcccgacatc 

103 ctcatcaccg tgggcggcgt gatccctgag caggacttcg acgagctgcg taaggacggc 

104 gccgtggaga tctacacccc cggcaccgtc attccggagt cggcgatctc gctggtcaag 

105 aaactgcggg cttcgctcga tgcctag 
107 <210> SEQ ID NO: 3 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2187 
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108 <211> LENGTH: 1851 

109 <212> TYPE: DNA 

110 <213> ORGANISM: Streptomyces cinnamonensis 

112 <400> SEQUENCE: 3 

113 atgacggtcc tgcctgacga cgggctttcc ctggccgccg agttccctga 

114 gagcagtggc accgccttgt ggaaggcgtc gtgcgcaagt cgggtaagga 

115 acggccgcag aggaagcgct gtccaccacc ctcgaggacg ggctcaccac 

116 tacaccgcgc gcgacgccgc gcccgacgcc ggtttccccg gcttcgcccc 

117 ggttccgtcc ccgagggcaa caccccgggc ggctgggacg tccggcagcg 

118 gccgatcccg cgcgcaccaa tgaagcggtc ctcaccgatc tggagaacgg 

119 ctctggctga cgctgggctc cgccggtctt ccggtcaccg gtctggagcg 

120 ggcgtctacc tcgacctggt gcccgtcgcg ctcgacgcgg gcagcgaggc 

121 gcccgggagt tgctgcgcct gtacgaggcc gcgggcgtcg ccgacgacgc 

122 acgctcggcg ccgacccgct cggccacgag gcccgcaccg gggagaagag 

123 gcggcggtcg ccgaactggc ccggctgtgc ggggagcgct accccggtct 

124 accgtggacg cgctgccgta ccacgaggcg ggtgcctccg ccgcgcagga 

125 tcgctcgcca ccggtgtcga gtatctgcgc gccctgcacg acaagggtct 

126 aaggccttcg cgcagctgga gttcaggttc gcggccaccg ccgaccagtt 

127 gccaagctgc gcgccgcgcg ccgcctctgg gcgcgtgtcg ccgaggtgtc 

128 gccgccgggg cgcagcggca gcacgcggtg acctcgccgg tgatgatgac 

129 ccgtgggtga acatgctgcg gaccaccgtg gcgtgcctcg gcgcgggtgt 

130 gacgccgtca cggtcctgcc gttcgaccac gagctcgggc tgccggacgc 

131 cgcatcgccc gcaacacgtc gacgatcctc ctcgaggagt cgcacctcgc 

132 gacccggcgg gcggctcctg gtacgtggag cggctcaccg atgaactcgc 

133 tgggacttct tcaaggagat cgagcgcgcg gacggtcagg tcgcggcgct 

134 ctggtcggcg accggatcgc cgcgacctgg gccgagcgca ggaagaagct 

135 cgcgaaccga tcacgggtgt cagcgagttc ccgctgctca ccgagcgccc 

136 gagcccgcgc ccgccgcccc gcccggcggt ctgccccggg tgcgccgcga 

137 gaggagctgc gcggccgctc ggacgcgcac ctggaagcga ccggcgcccg 

138 ttcatcgccg cgctggggcc ggccgccgcg cacacggcgc gcgcgacgtt 

139 ctcttcatgg cgggcggcgt cgagccggtg cacgacccgg tgtcggtgga 

140 gccgccgagg cgttcgcggc gagcggcgcc accgtcgcgt gcctctgctc 

141 ctctacgccg agcaggcgga agcggtcgcg cgggccctga agtcggcggg 

142 gtgttcctcg cggggcgcgg ggagttcgcc gacatcgacg agtacgtctt 

143 gacgcggtcg cggtgctcac ctccaccctc gaccgcatgg gagtggcgta 

145 <210> SEQ ID NO: 4 

146 <211> LENGTH: 2202 

147 <212> TYPE: DNA 

148 <213> ORGANISM: Streptomyces cinnamonensis 

150 <400> SEQUENCE: 4 

151 atgcggatcc ccgaattcga cgacatcgaa ctgggcgcgg gcggcggccc 

152 gcggagcagt ggcgcgccgc cgtgaaggag tccgtcggca agtccgagtc 

153 tgggagacgc ccgagggcat cgcggtcaag ccgctgtaca cgggcgccga 

154 ctggacttcc tggagacgta cccgggtgtc gcgccgtatc tgcgcggccc 

155 atgtacgtga accagccgtg gacgatccgg cagtacgcgg gattctccac 

156 tccaacgcct tctaccgccg caacctcgcg gcaggccaga aggggctctc 

157 gacctgccca cgcaccgcgg gtacgacagc gaccacccgc gcgtcaccgg 

158 atggcgggcg tggccatcga ctccatctac gacatgcgtc agctcttcga 

159 ctggacaaga tgacggtgtc gatgacgatg aacggtgccg tgctgcccgt 



tgcgacccat 
cgtatcgggg 
ccgccccctg 
tttcgtcagg 
gtacgcgagc 
cgtcacctcg 
tgcgctcgac 
cgcgaccgcc 
ggtgcgcggc 
cacctccttc 
gcgcgcgctg 
gctcggcgcc 
cggtgtcgag 
cctcaccatc 
cggggtgccg 
ccgccgcgac 
gggcggtgcc 
gttcgcgcgc 
gcgtgtcatc 
ccacgcggcc 
ccgctccggc 
ggcccggcgc 
ggtcgagcgc 
cgaggcgtac 
cccgaaggtg 
cgccgccaac 
cgcggagacg 
cagcgacgtg 
cgcgctgcgg 
cgcgggctgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1851 



gtccggctcg 60 • 
cgacctcctg 120 
cgtcgagggc 180 
ctacccgacg 240 
cgccgaggag 300 
ggtcgccttc 360 
tgacgtcggc 420 
cggcattccg 480 
tctcgcgctg 540 



file://C:\CRF4\Outhold\VsrI699 1 36A.htm 



8/29/03 



RAW SEQUENCE LISTING DATE: 08/29/2003 

PATENT APPLICATION: US/09/699 , 136A TIME: 14:19:41 



Input Set : A:\010025-01 SEQ LISTING.DBF.TXT 
Output Set: N:\CRF4\08292003\l699136A.raw 



160 tacatcgtgg cggccgagga gcagggcgta ccgccggaga 

161 aacgacatcc tcaaggagtt catggtccgc aacacctaca 

162 atgcggatca tctccgacat cttcgcgtac acgtcgcaga 

163 atctcgatct ccggctatca catccaggag gcgggcgcga 

164 tacacgctcg ccgacggtgt ggagtacctg cgagccgggc 

165 gacgcgttcg cgccgcggct ctccttcttc tgggcgatcg 

166 gtcgccaagc tccgcgcggc gcgcctgctc tgggcgaagc 

167 aagaacgcca agtccctctc cctgcgcacc cattcgcaga 

168 gcgcaggacg tgttcaacaa cgtcacgcgc acgtgtgtcg 

169 ggccacacgc agtccctgca cacgaacgcc ctggacgagg 

170 ttctccgcgc ggatcgcccg caacacccag ctgctcatcc 

171 cggacgatcg acccgtgggg cggcagcgcg tacgtcgaga 

172 cgccgcgcct ggcagcacat cgaggaggtc gaggcggcgg 

173 gacgcgggca tcccgaagct gcgcgtcgag gaggccgcgg 

174 gactcggggc gccagccggt catcggcgtc aacaagtacc 

175 atcgacgtcc tgaaggtcga caactcctcg gtgcgcgccc 

176 cgcctgcgcg aggagcgtga cgacgccgcc tgccaggacg 

177 gcggccgagc gtggtcccgg ccagggactg gagggcaacc 

178 gcggcccggg cgaaggccac ggtcggtgag atctccgacg 

179 cggcacgcgg gccagatccg tacgatctcc ggtgtgtacc 

180 ccgagcgtgg agcgcacgcg tgccctggtg gacgcgttcg 

181 ccgcgcatcc tcgtcgcgaa gatgggtcag gacggccacg 

182 gcgagcgcct tcgccgacct gggcttcgac gtcgacgtcg 

183 gcggaggtcg cgcgccaggc cgtcgaggcg gacgtgcaca 

184 gccgcagggc acctcaccct cgtaccggca ctgcgcgagg 

185 gacgacatca tgatcgtcgt gggcggcgtc atcccgccgc 
18 6 gaggcgggcg ccacggcggt gttcccgccc ggcacggtga 
187 ctggtgaagc gtctggcggc cgacctcggc cacgaactgt 

189 <210> SEQ ID NO: 5 

190 <211> LENGTH: 447 

191 <212> TYPE: DNA 

192 <213> ORGANISM: Propionibacterium freudenrei 

194 <400> SEQUENCE: 5 

195 atgagtaatg aggatctttt catctgtatc gatcacgtgg 

196 gacgaggctt ccaagtacta ccaggagacc ttcggctggc 

197 aacccggagc agggagtcgt cgagatcatg atggccccgg 

198 atgacccagg ttcaggtcat ggccccgctc aacgacgagt 

199 gccaagcaca atggtcgcgc cggactgcac cacatggcat 

200 gccgtcagcg ccaccctgcg cgagcgcggc gtgcagctgc 

201 ggcaccggcg gcaaccgcat caacttcatg catcccaagt 

202 gagctcaccc agtacccgaa gaactga 

204 <210> SEQ ID NO: 6 

205 <211> LENGTH: 148 

206 <212> TYPE: PRT 

207 <213> ORGANISM: Propionibacterium freudenrei 

209 <400> SEQUENCE: 6 

210 Met Ser Asn Glu Asp Leu Phe lie Cys lie Asp 

211 1 " 5 10 

212 Cys Pro Asp Ala Asp Glu' Ala Ser Lys Tyr Tyr 



agctggcggg 
tctatccgcc 
agatgccgcg 
cggccgacct 
aggaggcggg 
gcatgaactt 
tcgtgaagca 
catcgggctg 
aggcgatggc 
cgctcgccct 
agcaggagtc 
agctgacgta 
gcggcatggc 
cgcgcaccca 
gggtggacac 
agcagatcga 
cgctgcgcgc 
tgctcgcgct 
cactggagag 
gcaccgaagc 
acgaggccga 
accgcggtca 
gcccgctgtt 
tcgtcggcgt 
agctggccgc 
aggacgtcga 
tcccggacgc 
ga 



gaccattcag 
gaagccctcg 
ctacaactcc 
ggagctggcg 
cctggacgtg 
cttcatggag 
gttcgacccg 
gtcgctgacc 
ggcgacgcag 
gccgaccgac 
ggggacgacg 
cgacctggcg 
gcaggccatc 
ggcgcgcatc 
cgacgagcag 
gaagctgcgg 
cctgacggcg 
cgcggtcgac 
cgtgtacggg 
aggccagtcg 
ggggcgcagg 
gaaggtgatc 
ccagacgccg 
ctcctcgctt 
ggagggccgc 
ggccctgcac 
ggcgcacgac 



chii ssp. shermanii 



catatgcgtg 
atgagctcca 
ctgcgaagct 
cgaccgttgc 
ggcgtgtcga 
tgtatgacga 
cgggcaaggg 



ccccgacgcc 
ccgcgaggag 
gaccgagcac 
caagtggctt 
tgacatcgac 
gcccaagctc 
cgtgctcatc 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2202 



60 

120 

180 

240 

300 

360 

420 

447 



chii ssp. shermanii 

His Val Ala Tyr Ala 
15 

Gin Glu Thr Phe Gly 
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